Visual units and confusion modelling for automatic lip-reading
نویسندگان
چکیده
منابع مشابه
Confusion modelling for automated lip-reading usingweighted finite-state transducers
Automated lip-reading involves recognising speech from only the visual signal. The accuracy of current state-ofthe-art lip-reading systems is significantly lower than that obtained by acoustic speech recognisers. These poor results are most likely due to the lack of information about speech production that is available in the visual signal: for example, it is impossible to discriminate voiced a...
متن کاملVisual Words for Automatic Lip-Reading
.................................................................................. i ACKNOWLEDGMENT.................................................................... iv ABBREVIATIONS.......................................................................... v CONTENTS................................................................................... viii LIST OF FIGURES...........................
متن کاملAutomatic Lip Contour Tracking and Visual Character Recognition for Computerized Lip Reading
Computerized lip reading has been one of the most actively researched areas of computer vision in recent past because of its crime fighting potential and invariance to acoustic environment. However, several factors like fast speech, bad pronunciation, poor illumination, movement of face, moustaches and beards make lip reading difficult. In present work, we propose a solution for automatic lip c...
متن کاملA System for Automatic Lip Reading
In this paper, we present our approach of face and lip detection, lip modeling, and tracking. A new lip model based on Bézier Curves is used to capture the dynamics of the lips efficiently. The model is defined only through few points which are modeled using the Active Shape Model (ASM). Accurate detection of lip details is implemented using multiple independent feature templates. The method de...
متن کاملLearning Visual Models for Lip Reading
This chapter describes learning techniques that are the basis of a "visual speech recognition" or "lipreading" system 1 • Model-based vision systems currently have the best performance for many visual recognition tasks. For geometrically simple domains, models can sometimes be constructed by hand using CAD-like tools. Such models are difficult and expensive to construct, however, and are inadeq...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Image and Vision Computing
سال: 2016
ISSN: 0262-8856
DOI: 10.1016/j.imavis.2016.03.003